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^^0 ' In the present paper, we prove that with probabihty one, the Stratonovich's sigature of a 

multidimensional diffusion process (possibly degenerate) over [0, 1], which is the collection of all 
, ■ iterated Stratonovich's integrals of the diffusion process over [0, 1], determines the diffusion paths. 
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^ ■ 1 Introduction 
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■ ■ Let Xt be an R''- valued continuous path over [0, 1] with bounded variation (d ^ 2). According to [7], 

[S], for ^ s < t ^ 1, we can define the sequence of iterated integrals 
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% 



Xs,t — (1, Xgj., Xg f, • • • , Xgf, ■ ■ ■), 



1/-^ . where 

00 



Xsj^ dX^,®---®dX^^, n^l. (1.1) 

-/ s<ui <---<'u^<i 



X"( is regarded as an element in the tensor space (E'*)'*" = M"'' and ~Ks,t is hence an element in the 
^^ . tensor algebra 



Xg^t is multiplicative in the sense that it satisfies the following Chen's identity: 

Xs,t = Xs,„(8)X„^t, Os^s<u<is^l. 

In other words, X^^f is uniquely determined by the original path Xt] or intuitively speaking, the original 
path Xt contains all information about its differential dXt . A remarkable consequence is that a theory 
of integration along Xt can be established in the sense of Riemann-Stieltjes, which leads to a theory 
of differential equations driven by Xt . Such a theory for paths with bounded variation is classical and 
well-studied. 
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If the path Xt is less regular, for example, Xt has finite p- variation for some p > 1, it may not 
be possible to establish an integration theory along Xt by using the information of the original path 
only. The fundamental reason is that the path Xt itself does not reveal enough information on its 
differential dXt , which is essential to be fully understood if we want to develop an integration theory 
along Xt. As pointed out by T. Lyons in [8], for this purpose, together with the path itself, a finite 
sequence of iterated integrals up to level [p] satisfying Chen's identity should be specified in advance. 
Such a finite sequence of iterated integrals 

Xs,t = (l,Xj t, • • • ,Xgt) 

is regarded as a multiplicative functional X from the triangle region A = {{s,t) : ^ s ^ i ^ 1} to 
the truncated tensor algebra 

r(W)(R'^) = ®|fIoIR"''. 

X is called a rough path with roughness p. According to [8], X extends uniquely to a multiplicative 
functional from A to T'--°°'> {R'^). In the founding work of T. Lyons in [5], a general theory of integration 
and differential equations for rough paths was established. 

For a rough path X with roughness p, the signature of X is defined as the formal sequence 

^(X) — Xo,i = (1, ATq 1, • • • ,-'^0,1' " ' )' 

where for n > [p], X"t is the unique extension of X as previous. The signature S'(X), proposed by 
K.T. Chen in [5] and T. Lyons in [8j, can be regarded as the collection of overall information of any 
arbitrary level n about the rough path X. It is of central interest and conjectured in the theory of 
rough paths that the signature S'(X) contains sufficient information to recover the path X completely. 
In the groundbreaking paper [i3j by B. Hambly and T. Lyons, they proved that for a path Xt with 
bounded variation, the signature of Xt uniquely determines the path up to a tree-like equivalence. 
However, for paths with unbounded variation, very few results are available and it remains a lot of 
work to do. 

In the work [6J by Y. Le Jan and Z. Qian, they considered the case of multidimensional Brownian 
motion and proved that for almost surely, the Brownian paths can be recovered by using the so-called 
Stratonovich's signature, which is defined via iterated Stratonovich's integrals of arbitrary orders. Since 
the Brownian paths are of unbounded variation and can be regarded as rough paths with roughness 
p G (2,3), we may need to specify the second level in order to make sense in terms of rough paths. 
However, according to [5], [1^, there is a natural lifting of the Brownian paths to the second level by 
using dyadic approximations, which is called the Levy's stochastic area process and it coincides exactly 
with the iterated Stratonovich's integral defined in the same way as (jl.ip . Such a lifting is determined 
by the Brownian paths itself, and in [6 1 when regarding the Brownian motion as rough paths they were 
actually using this lifting. Consequently, in |6j the recovery of the Brownian motion as rough paths is 
essentially the recovery of the Brownian paths in terms of Stratonovich's signature. 

In the present paper, we are going to generalize the result of Y. Le Jan and Z. Qian in [6J to the 
case of multidimensional diffusion processes (possibly degenerate) . The idea of the proof is simlar to 
the case of Brownian motion, in which they used a specially designed approximation scheme and chose 
special differential 1-forms to define the so-called extended Stratonovich's signatures to recover the 
Brownian paths. However, there are several difficulties in the case of diffusion processes. Firstly, we 
need quantitative estimates for rare events of diffusion processes to prove a convergence result similar 
to the Brownian motion case. In [6], they used the symmetry and explicit distribution of Brownian 



motion, which are not available in the case of diffusion processes and hence we need to proceed in 
a different way. Secondly, to construct special differential 1-forms, they used a rather special case 
of Hormander's theorem to ensure existence of smooth density, in which the so-called Hormander's 
condition was easily verified. In the case of diffusion processes, the construction of differential 1- 
forms is more complicated in order to ensure hypoellipticity. Lastly, in the Brownian motion case, the 
Laplace operator is well-posed so that PDE methods was applied in [3] to obtain a crucial estimate 
which enables us to relate the extended Stratonovich's signatures to the Brownian paths. However, 
for a general diffusion process, the generator L may not be well-posed any more (we don't impose 
uniform ellipticity assumption on L) and PDE methods may no longer apply (in fact, to ensure the 
application of PDE methods, rather technical assumptions should be imposed on the operator L and 
the domain of solutions if without uniform ellipticity). Therefore, we need another approach to recover 
the diffusion paths by using extended Stratonovich's signatures. 

2 Main result and idea of the proof 

Li this section, we are going to formulate our main result in a rigorous way and illustrate the idea of 
the proof. 

Let (n, J-", P) be a complete probability space and let Wt be a d-dimensional Brownian motion on 
J7. We consider an A^-dimensional [N ^ 2) degenerate diffusion process Xt driven by Wt, defined by 
the following SDE in the geometric formulation: 

d 

dXt = ^14(X0odWf + Fo(XOdt (2.1) 

a=l 

with Xq = 0. 

Here we make the following assumptions on the generating vector fields {Vi,- ■ ■ , V^; Vb}. 

(a)K),Fi,---,FdeCr(M^); 

(b) for any x G M^, the Hormander's condition (see [4]) holds at x in the sense that: the countably 
many vectors 

[Vo^.Vp], a,/3 = 0,l,--- ,d; 
[V^,[Vp,V^]i a,A7 = 0,l,---,d; 



generate the tangent space T^SJ^ = K^, where [•, •] denotes the Lie bracket; 

(c) there exists a positive orthonormal basis {ei, • • • , cm} of R^, such that for any x G M^ and 
i = 1, 2, • • • , iV, Va{x) is not perpendicular to e^ for some a = 1, 2, • • • , d. 

Remark 2.1. Assumptions (a) and (b) are made to ensure the hypoellipticity of the generator 






of the diffusion process (|2.ip . Assumtion (c) is made to ensure the escape condition and the outer 
normal condition proposed in [1] on some domain of interest which is of cube shape and wih be chosen 
relatively small. With these assumptions, we are able to apply results in [Ij to obtain the existence of 
continuous density function of the Poisson kernel for the domain of interest and a quantitative estimate 
on the density function, which are both crucial in the proof of our main result. 

It should be pointed out that if the diffusion process (|2.ip is nondegenerate, that is, if the gen- 
erating vector fields {Vi, • • • , y^; Vb} satisfy the following ellipticity condition: for any x S M^, the 
vectors Vi(x), • • • , Vd{x) generate the tangent space Tj-M^ ^ K^, then assumptions (a), (b), (c) are 
automatically verified. 

For n > 1, ji, • • • , j„ e {1, 2, • • • , N}, define the iterated Stratonovich's integral of order n: 
[ji , • • • , j„],,t = [ odXll o dXll o . . . o dXl: , O^sKt^l. 

Js<ti<-<t„<t 

Alternatively, [ji, ■ ■ ■ ,jn]s,t can be defined inductively by the following relation: 



b'l, ■ • • , jn]s.t = / [jl, • • ■ , jn-l]s,u O dXl" , S^ S < t S$ 1, 

J s<u<t 

where [jij^.t is defined to be 



[jiUt = / odXi' =Xl'-Xl\ < s < t sC 1. 

J s<u<t 

For convenience, if n = 0, we denote [ji, • • • ,jn]s,t = 1- The family 

{[jl,--- ,in]o,l : Jl,--- ,jn e {1,2,--- ,7V}, 71 ^ 0} 

of iterated Stratonovich's integrals is called the Stratonovich's signature of Xt over [0, 1]. 

Let Fi be the completion of the cr-algebra generated by the diffusion process Xt over [0, 1], and 
let Qi be the completion of the a-algebra generated by the Stratonovich's signature of Xt over [0, 1]. 
More precisely. 



Fi = o{Xt : s$ t s^ 1), 



Qi = ct({[.?i,--- ,Jn]o,i : Jir-- ,i«e{l,2,--- ,iV},n^O}). 

In Y. Le Jan and Z. Qian's paper [6J, they formulate the problem for Brownian motion in the following 
way: 

Fi - Gi- 

The result formulated in such a way for the diffusion process in our setting can be proved in the present 
paper. However, we are going to present a more illustrative formulation of the problem, which to some 
extend reveals how we can reconstruct the diffusion paths by using the Stratonovich's signature over 
[0, 1] in a conceivable way. 

First we need the following definition. 

Definition 2.1. A piecewise linear trajectory (P.L.T.) T in M.^ is a finite sequence of points in M^ 
(not necessarily all distinct). Here we always assume that the number of points in T is greater than 
one (if T consists of only one point x, we will regard Tas the finite sequence (x, x)). For a P.L.T. T in 
]S.^ , the number of points in Twill be denoted by |T|. If the points of T belongs to a subset F C M^, 
we say that T is a P.L.T. in F. 



The reason why we use the word "piecewise hnear trajectory" is that when given T, we actually 
think of T as a piecewise linear graph by connecting the points in T by line segments in order. Here 
we should point out that the order of points in T is rather important, and no parametrizations are 
involved. 

Definition 2.2. For n ^ 2, a parametrization a of order n is a partition of the time interval [0, 1] into 
n — 1 nontrivial subintervals: 

a : = ti < t2 < ■ ■ ■ < i„-i < <„ = 1. 

The space of all parametrizations of order n will be denoted by Vn- 

Let T be a P.L.T. in M^ and let cr be a parametrization of order |T|. The piecewise linear path over 
[0, 1] defined by applying linear interpolation of T along the parametrization a is denoted by T{t\a). 

Our formulation of the problem is related to a kind of convergence which is parametrization free. 
Therefore, we need the following definition of convergence in trajectory. 

Definition 2.3. Let (74)0^*^1 be a continuous path in M^. A sequence {T'-"-'} of P.L.T. s is said to 
be converging in trajectory to (7t)o^«^i if 

lim inf sup |7t-r^"'(i|CT)| -0. 

Now we are in a position to state our main result. 

Theorem 2.1. Let Z be the space of P.L.T. s in Z^ equipped with the discrete a-algehra. Then there 
exists a sequence {T'"'} of Z -valued Qi-measurable random variables (random P.L.T.s), such that 
with probability one, — • T"*-"^ converges in trajectory to the diffusion paths (^t)o^t^i- 

It seems that the statement of Theorem 12. II does not contain much information about the approx- 
imating sequence {T'"'}. However, when going into the proof of Theorem l2.1l (Section 3), we will see 
that 7"'"^ is constructed in a quite explicit way. 

A direct consequence of Theorem 12. H is the result based on Y. Le Jan and Z. Qian's formulation. 

Theorem 2.2. J"i = ^1. 

Before proving our main result Theorem 12.11 in the next section, we first illustrate the idea and 
main steps of the proof. 

We adopt the scheme and the key observation that the diffusion paths can be recovered by reading 
out the maximal seqence of well-chosen compactly supported differential 1-forms such that the iterated 
Stratonovich's integral of those 1-forms (extended Stratonovich's signature) along the diffusion paths 
over the duration of visit of their supports is nonzero , which were proposed by Y. Le Jan and Z. Qian 
in [6]. 

The idea of the proof of Theorem 12. II is the following. 

Firstly, divide the whole space M^ into disjoint small boxes and narrow tunnels. By recording 
the successive visit times of those small boxes, we can construct a piecewise linear approximation of 
the diffusion paths. A convergence theorem can be proved by developing certain types of estimates 
of rare events for the diffusion process. By enlarging the size of those small boxes a little bit (by a 
higher order infinitesimal relative to the size of boxes), we can similarly get another piecewise linear 
approximation also converging to the diffusion paths as the size of boxes goes to zero. Secondly, we 



construct a family of "special" differential 1-forms on M.^ (depending on the size of boxes) in a way 
that for any larger box, we construct a 1-form supported in it such that it is highly non-degenerate 
on the inner smaller box. The crucial observation is that the Stratonovich's integral of any of those 
1-forms along the diffusion paths over the duration of visit of its support is nonzero. It turns out 
that for a diffusion path, we can read out an associated unique maximal finite sequence of 1-forms (a 
P.L.T.) recording a sequence of boxes in order such that the iterated Stratonovich's integral of this 
sequence of 1-forms (extended Stratonovich's signature) along the diffusion path over the duration of 
visit of their supports is nonzero. It provides us with sufficient information to recover the diffusion 
path by taking limit in a reasonable way. This is due to the fact that based on our construction, we 
can prove that such a maximal sequence always "lies" between the two piecewise linear approximations 
constructed before, both of which converge to the diffusion path. Here we need to develop a kind of 
squeeze theorem for the type of convergence (convergence in trajectory in the setting of P.L.T. s defined 
as before) in our situation. 

To carry out the above idea, we are going to establish the following three steps. 

(1) Step one: proving a convergence result for the piecewise linear approximation based on succes- 
sive visit times of small boxes. 

The proof consists of two ingredients. The first one is a probabilistic estimate of the number of 
boxes visited over the time duration [0, 1], which can be developed by using a random time change 
technique. It turns out that we can reduce to the Brownian motion case. The importance of such an 
estimate is that we can get an asymptotic rate of the probability that the number of boxes visited 
over [0, 1] is quite large. The second one is the probabilistic estimate of the uniform distance between 
the piecewise linear approximation path and the original diffusion path, provided the number of boxes 
visited over [0, 1] is fixed. This can be done by using the Strong Markov property and a quantitative 
result in [1] by G. Ben Arous, S. Kusuoka and D.W. Stroock, which gives us control on the density of 
the Poisson kernel of a given bounded domain in E^ and enables us to estimate the probability that 
the diffusion process travels through narrow tunnels. Combining the two ingredients, it is not hard to 
prove the convergence result by using the Borel-Cantelli's lemma via a subsequence. 

(2) Step two: constructing differential 1-forms and using extended Stratonovich's signatures. 

For any larger box, we are going to construct a suitable differential 1-form supported in it and 
highly nondegenerate on the inner smaller box. The construction of such a differential 1-form can be 
reduced to the construction of a differential 1-form such that the generator of some associated SDE with 
dimension A'^ -|- 1 is hypoelliptic on the support of the differential 1-form. The family of diffenrential 
1-forms constructed in such a way will be used to construct extended Stratonovich's signatures, which 
in turn will be used to recover the diffusion paths as stated in the idea of the proof. 

(3) Step three: proving a squeeze theorem for convergence in trajectory to recover the diffusion 
paths. 

From the above two steps we constructed two sequences of piecewise linear approximations of the 
diffusion paths, and between which a sequence of P.L.T. s in terms of extended Stratonovich signatures. 
We will formulate the term "lying bewteen" in a rigorous way in the setting of P.L.T. s and prove a 
squeeze theorem for convergence in trajectory which fits our situation. Here the squeeze theorem we 
are going to prove is not in the most general case (we need to make use of special parametrizations), 
so we need to modify the piecewise linear approximation associated to larger boxes to fit our case. 

An advantage of using such a squeeze theorem is that we can get around the estimates based on 
potential theory and partial differential equations, which was used by Le Jan and Qian in fB] for the 
Laplace operator. In fact, for a general elliptic operator L, the associated partial differential equation 
may not be well-posed and the conditions to ensure a (regular) probabilistic representation of a solution 



is quite restrictive and technical. 

3 Proof of the main result 

In this section, we will give the detailed and rigorous proof of our main result Theorem 12.11 
Recall that {Xt : t ^ 0} is an A^-dimensional diffusion process defined by the following SDE: 

d 

dXt = Yl ^"(^*) ° '^^t" + VoiXt)dt 

a=l 

with Xq = 0, in which the generating vector fields satisfy assumptions (a), (b), (c). 

From now on, without loss of generality, we will assume that in assumption (c) the positive orthog- 
onal basis {ei, • • • ,6^} of E^ is taken to be the canonical basis. In fact, if it is not the case, then 
the constructions in the sequel should be made under the frame {ei, • • • , cn} instead of the canonical 
basis of M^ (for instance, the faces of a box constructed below should be parallel to the hyperplanes 
perpendicular to those e^'s). 

3.1 Discretization and an approximation result 

Similar to the idea of Y. Le Jan and Z. Qian, we first construct a suitable approximation scheme for 
the diffusion paths. 

For convenience, a constant is called universal if it depends only on the generator L and the 
dimensions N, d. Moreover, sometimes we may use the same notation to denote universal constants 
coming out from estimates, although they may be different from line to line. 

Let < e < 1. For z = (zi, • • • , ZTv) e Z^, let i/| be the iV-cube in M^ defined by 

i/f = {{xi,--- ,xn) ■■ £z, — s^ Xi ^ ez,-\ — , i = 1,2,--- ,N}, 

where /i is some universal constant to be chosen later on. For technical reasons we assume that each of 
the 2^ corners of H^ is smoothed to the order of e^'', in the sense that if wi, • • • , ^2" a-re the vortexes 
of iJ| and H^ is the associated cube with smoothed corners, then 

2" ^ 2" 

where _B(wj, e^^) is denoted as the ball centered at Vj with radius e^'^. Such a smoothing procedure 
can be done by using standard mollifiers. In the case oi N = 2, we can simply replace each corner of 
iJ| by a quater of some small circle to obtain a square H^ with rounded corners. For simplicity, in the 
following we will use the same notation HI to denote the cube with smoothed corners. The space M^ 
is then divided into disjoint small boxes and narrow tunnels. 

Now we are going to construct an approximation of diffusion paths Xt over the time duration [0, 1]. 

Let Tq = and mp — (0, • • • ,0). For fc ^ 1, define 

Tl = inf{t > t|_i : Xte U HI). 



If t| < oo, define m| be the integer point in Z^ such that Xr' E H^c ; if t| = oo, define m| = m'j._^. 
Intuitively, the sequence of hitting times {t|}^o records the successive visit times of the small boxes 
and the sequence of integer points {ti|}^q records the boxes visited by the diffusion paths in order 
(revisit of the same box before visiting other boxes doesn't count). Note that it is possible that 
P(t| = oo) > since with positive probability the process can always stay in narrow tunnels after 
leaving some box. 

Let Mjj be the number of boxes visited by the diffusion paths over the time duration [0, 1]. Formally, 

Mjj = inf{fc ^ : rf+i > 1}. 

It follows from uniform continuity of the diffusion paths over [0, 1] that Af|f < co for almost surely. 
By a standard random time change argument, we can prove the following. 



Lemma 3.1. Let C = niax{|| Vi||oo, • ' ' , ||Vd||oo, |1K) + 5 Ea=i ^^c^alloo}- Then for any k> ^, 

P{MIj == fc) < 4A^fce"577I^. 



Proof. For fc J^ 1, it is obvious that 

P{Mfj = fc) = P(r| < 1, t|+i > 1) 
k 
^ n[j{r!-rU^-, rf ^1}) 
1=1 

k 

1=1 

k 

where the last inequality comes from the fact that the distance between two different boxes is bounded 
from below by e^ . 

To carry on, by the strong Markov property, it suffices to estimate 

P( sup \Xt - x| > e''), 

where Xt is the diffusion process defined by (|2.ip starting at x G M^. By rewriting (|2.ip in the sense 
of Ito, we have 

UXt - Ea=l Vo.{Xt)dW^ + %{Xt)dt, 

where Vo = Vq + \ YI^i Vv.K.- 



It follows that for fc > 2£ we have 



P( sup \Xt - X| ^ £^) 



=P( sup I V / V^{Xs)dWf+ f Vo{Xs)ds\ ^e'') 
<P( sup \J2 f V„{Xs)dW^\ >e^~^) 
^P( sup if^ / V^{X,)dW^\^'-^) 

JV d t n 

<^P( sup 1^/ Vi{XMW:\>-^). 



Let 



then 



d „t 






d rt 






Smce a local martingale is a time-changed Brownian motion (see [S], Chapter Two, Theorem 7.2'), 
there exists a Brownian motion Bl (probably on an extended probability space but it doesn't matter 
since we only care about probability distributions in our situation), such that 

Consequently, 



P( sup lX,-xl^e'-)iY.P( sip W\»ir7^'> = T,''( ™P IB(M.>,l>::r7s) 



Af ,, N 



<^P( sup |p;|^ )^25:P( sup i?:>-7=) 

~^ 2vjV ief'/(2Vlv) y'2TrdC''/k 



OO -1 



1»,2 



By the inequality 



=4A^ / ^=6-52/ dy. 

2v^V tiC2 



1 _lf2 , _i^2 

e a": rft < e 2^ X > 0, 



27r 



we arrive at 

P( sup \Xt-x\^ e'') ^4:Ne~^^. 

Therefore, we obtain that 

P{MIj = k) i^4Nke'^^^^^c^, k> — , 

which completes the proof. D 

Now we define polygonal approximations of the diffusion paths through succesive visit of those 
boxes. More precisely, if Mf^ = 0, define X^ = (0, 0, • • • ,0) on [0, f]; otherwise for f ^ fc ^ ^h^ define 

and on [tm' , 1], define X^ = em\,jE . Then we have the following convergence result. The proof is 
developed for arbitrary A^ ^ 2, but in the case of iV = 2 the idea is easier to visualize. 

Proposition 3.1. There exists a subsequence e„ — > 0, such that with probability one, (Xj^")o^t^i 
converges uniformly to the diffusion paths (Xt)o^t^i on [0, 1] as n ^- oo. 

Proof. We aim at estimating the following probability 

P( sup \X^-Xt\>Xe), 

where A is a large universal constant to be chosen later on. For convenience, we will assume that j^ 
is a positive integer. 

For this purpose, let fc be a large integer to be chosen later on (may depend on e). It follows that 

P{ sup \XI-Xt\>Xe) 



s; ^ P( sup \X^ - Xt\ > Ae, A'Ph = + PiMf^ > k) 



^ E ^( U { sup \XI -Xt\> Ae, Mjj - 1} U{ sup \Xl ~Xt\> Ae, M^ - I}) 
1=0 j = l -/-i<*^-/ -f<*^i 

+ P{MIj > k) 

k I 

< V[V P( sup \X't -Xt\> Ae, rj sC 1) + P( sup \X^ - Xt\ > Ae, M^ = /)] 
1=0 1 = 1 -/-i<*^-/ -f ^*^i 

+ P(M|f >fc). 

We first estimate P(sup e <.t<T^ \^t ~ ^t\ > Ae, rf ^ 1). The idea is the following: the event 
{sup_e <t<T^ \^t ^ ^t\ > Ae, T^ ^ 1} implies that after time r? , , the process must have travelled 
through many narrow tunnels and spreaded far away from HL^e by many boxes before visiting 
another box. Define ao to be the first time after t? j^ that the process arrives at the entrance of some 

10 



narrow tunnel which is far away from H^e with distance at least g-e without hitting any other boxes. 

For 1 ^ L ^ -^/IS, define (tl to be the first time after ctl-i that the process travels through a narrow 
tunnel without hitting any boxes other than HL,e (define ctq = oo if there is no such arrival and 
(Ti = oo if there is no such travel through). It is easy to see that 

{ sup \XI -Xt\> Ae, t' ^1} c{(Jo <(Ti<--- < aA/12 < oo}. 

Thus it suffices to estimate P{(7o < ai < ■ ■ ■ < crA/12 < 00). This can be done by using the strong 
Markov property and a quantitative estimate for the Poisson kernel of some nice domain in [IJ. In 
fact, by the strong Markov property. 

F(cto < (Tl < • • • < crA/12 < 00) 
=E[P{(To <(Ji < ■■■ < fTA/12 < oo|J"^^^^_J, cta/12-1 < 00] 
^^[p^-vi2-i('")({c^' : u' e T{u)}), aa{u) < ai{u) <■■■< aA/i2-i(w) < 00], 

where T{lu) denotes the set of sample paths co' of the diffusion process starting at X^^,-^^_-^(uj) such 
that the first time of travelling through a narrow tunnel without hitting any boxes is finite. By the 
assumptions on the generating vector fields, the generator L and those small boxes H^ verify the 
conditions of Lemma 2.6 in [Ij. It follows from the lemma that the Poisson kernel H{x,drj) of any 
small box has a continuous density h[x, ry) with respect to the normalizd surface measure drj. Moreover, 
there are universal constants (in particular, not depending on e) Kq, vq > 0, such that 

\h{x,rj)\ «; is:o-dist(a;,aG)/dist(a;,r/)'^«, 

for any x in the box and 77 on the boundary. Since travelling through narrow tunnels implies es- 
caping through narrow windows of the boundary of some associated domain, it follows that on 
{u : CTa/i2-i(w) < 00}, 



P^-A/i2-iM({^'. uj' e T{uj)}) i^ K{e - e^' - 2e^''y-''° • — , 



for some universal constant K > 0. Now it is clear that if we choose /u to be universal and far greater 
than i/Q, then on {a; : cr\/i2-i{uj) < 00}, we have 

p^-A/i2-i(")({^' . ^' g T{uj)}) ^ Ke^-"", 

for some universal constant K > 0. Therefore, 

P(cto <ai <■■■ < ax/12 < 00) ^ Ke''^''°P{cro < cti < • • • < ax/12-1 < 00). 

By induction, it is immediate that 

P{aa <ai < ■■■ < ax/12 < 00) ^ R-^ e^'^>'~''°'> P(ao < 00) 

Therefore, we arrive at 

P( sup \XI ~ Xt\ > Ae, T^ s^ 1) s^ KT2 £T2it^-^o) . 
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The estimate of P(sup^e<f<]^ \Xf ~ Xt\ > Ae, Alfj = I) is exactly the same as above since on 
{Mjj = I}, there will be no visit of boxes other than H^c during [rf , 1]. 
Now consider P{Mfj > fc)- By Lemma EUl if fc > ^, 



OO 

P{Mjj >k) = Y. P^^'^H = I) 

l=k+l 

OO 

^ 2^ ANle sjvdc^ 



^ 






Ce2M\2 1 _ „-Ce2M 



(l-e-C^^'^)2 1-e 



where C — „ „\„i, . Choose a universal constant 7 >> 2/i, and let fc = [4;7] (when e is small, the 
condition k > ^ in Lemma [XT] is satisfied). It follows that 



P{MIj >k)^C'{ ^ "^"^ 



-Ce'^i^\2 f7 1 _ p-Ce^f" ^' 



(l-e-f^^'")2 eTf 



e 



where C" is a positive universal constant. 

Combining with the estimates before, we arrive at 

P( sup \X^~Xt\ > Ae) 



^C"(i^A£A(''-'^°)-27 + ^ . + — ^ . ). (3.1) 



(l-e-C^^'")2 
Finally, choose a positive universal integer A such that 

247 + 24 



1 e^ 


c 


e^l- 


g-Ce^M 



A> 



M~ ^0 



and T^ is a positive integer. By taking £„ = 1/n, we have 

00 
y P{ sup \X'r - Xt\ > Ae„) < 00. 

The Borel-Cantelli's lemma then yields the desired result. 

Now the proof is complete. D 

Let /i' be another universal constant greater than fi. Define (V^/j^f ,'n.|, My,X^) in the same way 
as {H^,T^,ml,,Mfj,X^) only with /i replaced by fj,', then Proposition 13.11 is also true for X'^ (with 
£„ = i as in the proof of Proposition 13. ip . 

To complete the proof, it remains to trace the diffusion paths via extended Stratonovich's signatures 
"between" smaller boxes 7?f and larger boxes V^ , and prove a squeeze theorem so that we are able to 
pass to the same limit Xt. 
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3.2 Constructing differential 1-forms and using extended Stratonovich's 
signatures 

To trace the diffusion paths by using extended Stratonovich's signatures, we first need to construct 
suitable compactly supported differential 1-forms, such that the Stratonovich's integral of any such 
1-form (f) along the diffusion paths over the duration of visit of supp is nonzero with probability one. 
To this end, it suffices to construct a suitable differential 1-form (f) on R^ with compact support 
such that the family of vector fields on M^+^: 

{( ^' \ ■■■ ( ""' Vf ^' V 



satisfies the Hormander's condition on (supp (/>) x M} so that the generator of the diffusion process on 
M^+i defined by 

[dXt = ELi Vo^iXt) o dWt + VQ{Xt)dt, 

\dX^+^^(^{Xt)odXu 

is hypoelliptic on (supp </>) x R^, which ensures the existence of smooth probability densities of certain 
Wiener functionals. Here and thereafter we use the geometric notation for convenience (so Va is a 
regarded as a column vector and </> is regarded as a row vector in M^). In fact, if this is possible, 
then we can proceed in the same way as Lemma 2.1, Lemma 2.2 and Lemma 2.3 in [3] to show 
that the Stratonovich's integral of </> along the diffusion paths over the duration of visit of supp is 
nonzero with probability one, since starting from this point the proof relies only on the strong Markov 
property and again the results in p] , which hold true from our assumptions on the generating vector 
fields {Vi,--- ,yd;Vo}. 

Now to make it more precise, for z G Z^ and e > 0, we are interested in constructing a differential 
1-form (t>1 such that 

HI C (supp <i>iy C supp <i>l C (F/)°, 

and 4>% has the property mentioned before on supp 0|. 
The following result gives the desired construction. 

Proposition 3.2. Assume that the family of vector field {Vi, • • • , V^; Vo} satisfies the Hormander's 
condition at every point x in M.^ . Let G he a hounded domain in M.^ and W he an open suhdomain of 
G such that 

W C<ZG. 

Let rj £ G^{M.^) he the cut-off function of W, that is, ^ rj ^ 1, rj = 1 on W and r] ~ outside 
a small neighbourhood of W . Then there exists A > 0, such that for any ^ G R^ with \^\ > A, if we 
define the differential 1-form (p on M.^ hy 

0(x) = r/(a;)e-^l^-«l'(da;i + • • • + dx^), (3.2) 

and define the vector field Va on R^+^ (independent of x^'^^) hy 

Va^(^^\,\. a = 0,l,---d, (3.3) 

then the family of vector fields 

W,--- ,Va;Vo} 
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satisfies the Hbrmander's condition at every point on (supp </)) x M. In other words, the differential 
operator L on R^+^ defined by 



l=Ij:v^+vo 



(3.4) 



is hypoelliptic on {supp (f>)xW. 

Proof. For a differential f-form (f) on R^ defined by p.2p . define the vector fields 

on M^+^ by (|3.3p . Note that supp (p is independent of ^, which will be denoted by K. 
Let 

Oi = {f,2,...,4; 

62 - {(a,/3): a,/3 = 0,l,---,4; 

©n = {(«!, ■ ■ ■ , an) : ai = 0, f , ■ ■ ■ , f^, « = 1, 2, ■ ■ ■ "-}; 

CO 

e = |Je„. 

n=l 

For 6 = (01, • • • , On) G 6„, denote 16*1 = n, and we use the notation V[g] (respectively, Vjgj) to denote 

[Ve.dVe^r ■ ■ , [Ve„^i,VeJ]] (respectively, [Ve.dVe^, ■■■ , [Ve„^i,VeJ]])- 
We first prove that for any 9 ^ Q, V[g] can be written as 



vm 



9ie] + • V[6 



for some g^g] G C^(R^+^) independent of x^^-^. In fact, when 6 G 81, it is just the definition of V[g]. 
Assume that it is true for any 9 £ 8„. Let 6 G Qn+i, then there exists some ^ a ^ d and 6' E 6„, 
such that 

By the induction hypothesis, we have 



Vr. 



where 



b-Va J '\ g[g,] + (j) ■ V[g,] 

g[e] + <t> ■ y[e\ 



9[e\ - V^<?[e'i • Vo. + l/[J,j • V~</.^ • V^ - Vj ■ V^<^^ • V^g,] e C^ (M^+i), 
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which is independent of x^^^. Here V^ denotes the gradient operator with respect to x^^^ = 
{x^, ■ ■ ■ ,x^) and (•)-^ denotes the transpose operator. 

Now we are going to prove the result by a compactness argument. 

A key observation is that for any fixed 6 E Q, let g^g-^ S C^(K^+^) be such that 

as in the previous discussion, then g!g-\ is of the form 

9[e]{x) =P[e](C;a;)e-^ !"=-«!', 

where p^gj (^; x) is a polynomial in ^ = (^^ , • • • , C^) with C^ coefficients depending only on x^'^' . Here 
the degree oi Pis] is at most \0\ — 1. In other words, 

|e|-i 

J=0 \a\=j 

Fix xo = (4^\a;^+i) G ii'" x Ri, where 4^^ = (^r" ,a;^) e R^. By the hypoellipticity 
assumption on L and continuity, there exists a neighbourhood U C if° of x^ and 0*^^', • • • , 6'(^) e 0, 
such that for any a;'^-' € C/, 

{y[,a,](xW),...,V«)](^^^')} 
constitutes a basis of R^. It follows that for any x E U x R^, the family of vectors in R^+-'^ 

{Vl0W]{x)r-- ,V[e(iv)](a;)} 

generate an A^-dimensional subspace of R^+^. Let M = max{|6''^^)|, • • • , l^^^^j}. Again by the assump- 
tion on L and continuity, it is possible to choose 9 E Q with 16*1 > M, such that 

degree (p[e]) >M (3.5) 

in a neighbourhood Uq <Z U of Xq . 

Now we are going to show that there exists A > 0, such that when ^ E R^ with |^| > A. the vector 
field V[0] cannot be generated by 

so that 

dim Spanj^fgd)], • • • , V[g(N)-^,V[g]} = iV + 1 

in Uq xM}, which yields that L defined by 13.41 is hypoelliptic in C/q x R^ . 
To prove this, first notice that there exists A'(a;*^^') E Cj^{Uq), such that 

N 

y[,](xW) = ^y(xW)V-)i(^^''^)' ^^Uo. 

i=l 

If for some xi eUq xM} ^ such that 

y[e]{xi) E Span{V[g(i)](a;i),--- , V[e(jv)](xi)}, 
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then we must have 

N 



It follows from simple calculation that 

N 

9[e]{xi) ^^X\x[^^)g[gi,,]{xi). (3.6) 



On the other hand, by cancelling out the term e~2l^^~^l on both sides, the L.H.S. of the reduced 
equation is a polynomial in ^ with degree greater than M, while the R.H.S. of the reduced equation is 
a polynomial in ^ with degree at most M. Therefore, the equation cannot hold for sufficiently large £,. 
It follows from the compactness of Uq that we can choose some A > 0, such that when ^ e M^ with 

lel>A, 

N 
i=l 

Therefore, when |^| > A, equation p.6p cannot hold in C/q x M^, which proves the claim. 

The case when xo S dK x R^ is almost the same as the previous one. Here we should repalce Uo 
by Uq n K. We should also notice that on the boundary dK, although ry itself and its gradient V77 
are zero, but the Hessian of j] is nonzero, which allows us to repeat the argument before with slight 
modification. 

Finanlly, combining with the above local results and by the compactness of K, we are able to 
choose A > (depending on K), such that for any ^ £ R^ with |^| > A, the differential operator L is 
hypoelhptic on isT x R^. 

Now the proof is complete. D 

For z G Z^ and e > 0, by taking W — HI and G — V^ , we can construct a differential 1-form (/)| 
supported in G according to Lemma [3.21 (just take some fixed admissible ^ G R^ as in the lemma). 
By proceeding in the same way as in [5], we conclude that the Stratonovich's integral of 0J along the 
diffusion paths over the duration of visit of supp (j)% is nonzero with probability one. 

Now we are going to construct extended Stratonovich's signatures to trace the original diffusion 
paths by using these differential 1-forms 0|. 

We first define extended Stratonovich's signatures formally. 

For smooth differential forms ip^ ,■ ■ ■ , ^'^ on R^, the iterated Stratonovich's integral [f/'^, • • • , "0 
(0 ^ s < i ^ 1) defined inductively by 

[V'\---,^1m= / [^\---,/-^]..„V'(°rf^n), 



fcl 



\s,t 



' s<u<t 

where 

N 

is called an extended Stratonovich's signature of the diffusion process Xt- 

The following lemma allows us to use extended Stratonovich's signatures for our study. The case of 
Brownian motion was proved in [6] , but we can easily adopt the proof to the our case without changing 
anything (in fact, the proof does not rely on probabilistic features, but only on paths). Recall that Qi 
is the completion of the cr-algebra generated by the Stratonovich's signature of Xt over [0, 1]. 
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Lemma 3.2. If ip^,- ■ ■ ^ip^ are smooth differential 1-forms on M^ with compact supports, then 

is Qi-measurahle. 

Proof. See [6j, Lemma 1.3. □ 

For ?7i ^ 0, let 

Wm = {{Zo = (0, • •• ,0),2:i,-- • ,Zr,^) : Zi G Z^, Zi ^ Zi^i, i = 1,2,-- • ,to}. 

An element {zq, zi, • • • , Zm) G Wm is called an admissible word of length m + 1. For e > 0, define the 
t/]^-measurable random variable on fl : 

M^ = sup{m > : [^l^, 0^^, ■ ■ ■ , 0z„]o,i 7^ for some admissible word (zq, zi, • ' ' > -^m) € Wm}. 
It follows that Mjj ^ M^ ^ My for almost surely. For m ^ 0, (zq, zi, • • • , Zm) e W,n, define 

then {A'^.,^ ^ ^ ■. : to ^ 0, (zq, zi, • • • , z^) € Wm} are mutually disjoint tj^ -measurable sets whose 
union is the whole space J7. See [6j for a more detailed discussion. 
Let 

be the space of admissible words. For e > 0, define the mapping y^ : J7 ^ W, 

3^''(a;) = (zo,zi,- •• ,Zm), 
if (zq, Zi, • • • , z„j) is such that 

It follows that y^ is well-defined and tjj^ -measurable. Intuitively, y^ is the maximal admissible word 
such that the associated extended Stratonovich's signature is nonzero. It is natural to believe that y^ 
records a reasonable amount of information of the diffusion paths and as e — > 0, it might be possible 
to recover the diffusion paths. 

3.3 Completing the proof: a squeeze theorem for convergence in trajectory 

In Section 2, we defined piecewise linear trajectories (P.L.T.s), parametrization of a P.L.T., and in- 
troduced the concept of convergence in trajectory. In this section, we are going to show that if y^ 
is regarded as a P.L.T. in Z^, then by taking En — -, with probability one, e„ • 3^"^" converges in 
trajectory to (^t)osjt^i, which completes the proof of our main theorem. 

Recall that a P.L.T. T is essentially a finite sequence of points in M.^ (not necessarily all distinct). 

Definition 3.1. For a P.L.T. T, T^ is denoted as the new P.L.T. by removing the last point of T. 
Let 7i,72 be two P.L.T.s. 71 is called a sub-P.L.T. of 72 (denoted by 71 -< T2) if 71 is a subsequence 
ofTz. 
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By the convergence result and the construction of cff^ in the last two subsections, if we denote X^ 
(respectively, X^) as the associated P.L.T. of the piecewise linear path X^ (respectively, X^), then it 
is immediate that 

with probability one, and X^^ and A"^" both converges in trajectory to (-'ft)o!jt^i- Therefore, it is 
natural to claim a certain kind of squeeze theorem for convergence in trajectory so we may conclude 
that y^^ also converges in trajectory to {Xt)o^t^i with probability one. 

The following result is a squeeze theorem for convergence in trajectory we are looking for, which is 
sufficient for our use. 

Proposition 3.3. Assume that {7^ }, {7^ } dfe two sequence of P.L.T. s such that: 

(1) the first points of 7^ and 7^ are identical; 

(2) the last two points of 72" are identical (i = l,2j. 

Let al" he a parametrization of 71 (i = 1,2) such that the partition points in ai belong to the 
partition points in CTj ^^d, for any t < 1 in <tI"' , 

r/"^(i|ai")) = r2(")(il4"^). 

(This assumption implies that (7^ )^ -< (7^ ) -j Let {T*-"^} be a sequence of P.L.T. s such that 

and (7t)o<t^i be a continuous path in M.^ such that 

lim sup |7:(")(t|al"V 7*1-0, * = 1,2. 

Then we can choose a parametrization a^^^i of T^"' , such that 

lim sup |r("Hik^"^)-7t| =0. 

In particular, T"*-"' converges in trajectory to (7t)o^t!ji- 

Proof. For any e > 0, there exists uq > 0, such that for any n > no, 

sup \r^"\t\<jl"^)^jt\<e, * = 1,2. (3.7) 

On the other hand, it is obvious that we are able to construct a parametrization cr'"^ of T^"', such 
that: 

(1) the partition points of a" belong to the partition points in <t^"' , and for any t < 1 in <tI"' , 

r/"^(t|ai"^) = r("ntk*"^); 

(2) the partition points in ct^"-* belong to the partition points in (T2 , and for any t < 1 in a^"' , 

7-(«)(t|^(«))^7-2(")(i|^(")). 
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(n) 

Let tn be the largest time spot in al such that i„ < 1. Let m„ < v„ be any consective time spots 
in (T^"\ then on [m„,w„] both 7^ (-Ictj^" ) and T*-"-'(-|cr'^"^) are hnear. Therefore, by an elementary 
result on the comparison for linear paths, we have 

sup |r/"^(i|a("))-r(")(ik^"^)l (3.8) 

^max{|r/"^KIa(")) -r(")Kk("^)|, |r/"^K|ai"^) -r(")Kk("))|}. 

If Vn ^ tn , then 

It follows from (|X7)) and (|X5)l that 

sup |r/"Ht|ai")) - r(")(ik^"^)l < 2e, n> no- 

If Un ^ tn, since the last two points of 7^ are identical (denoted by a;*^")), it follows that 

sup |r/"\i|(Tf'Vr("'(ik("))|scmax{|:E(")-r("'K|(7(")), |a;(")-r(")Kk("))||}. 

Obviously 

But it may not be true for Vn since it is possible that w„ == 1. However, since 7"(") -< 7^ , there exists 
some Wn > Un, such that 

r(")Ki<T(")) = ri^^Kidf ^), k = vn if «„ < i). 

Due to the fact that 7^ = a;*^"-' on [i„, 1], we arrive again at 

sup \Tl"\t\a["'>) - r(")(ik^"^)l < 2£, n > no. 

Consequently, 

sup \Tl"\t\a^r^) - T^"\t\a^"^)\ < 2e, n> no. 

It follows that 

lim sup |r("Hik^"^)-7tl =0, 

and in particular, 7"'"' converges in trajectory to (7t)o^tsji- 

Now the proof is complete. D 

In order to apply Proposition [231 we are going to modify X"^ and choose a suitable parametrization 
based on the one for X^ specified in Subsection 3.1, which is chosen according to the successive visit 
time of larger boxes for the diffusion paths (excluding revisit of the same box before visiting other 
boxes) , so that the assumptions of Proposition 13.31 are all verified. 
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The method is the foUowing. By ushig the notation in Subsection 3.1, if (C^, t[ , Ck+i) is such that 

Cfc < T-f < Ck+1 ^ 1, 

then we modify the hnear path X'^ on [Cfc;Cfe+i] to a new path such that it does not move during 
[C^, rf ] and goes directly from its initial position at t = t[ to Xf^ a.t t ~ C^^-^ with constant velocity. 
Similarly, if 

Ck<r! <i<a+i, 

then we modify the linear path X^ on [(^, 1] (in fact, X'^ remains still on [C|, 1]) to a new path such 
that during [Cfi^f] and [rf,l] it remains still (equals Xfc). It seems that such modification is trivial 
and does not change anything, but it does make a slight difference if we are using the associated P.L.T.. 
Let X^ be the modified piecewise linear path of X^ and let X^ be the associated P.L.T. of X'^. If we 
can prove that X"^" converges uniformly to {Xt)o^t^i with probability one, then all the assumptions 
in Proposition 13 . 31 for the triple sequence {(A''^",e„ • 3^*^", A"^")} are verified, and we will complete the 
proof of Theorem 12. II In fact, it is just a simple modification of the arguments in Subsection 3.1. 

Lemma 3.3. With probability one, (Xj^")o^f^i converges uniformly to the diffusion paths {Xt)o<^tfii- 

Proof. Exactly the same as the proof of Proposition 13.11 This is because when we estimate 

P( sup \X^-Xt\>Xe, q^l) 

as we do for X^ in the proof of Proposition 13.11 we can still conclude that the event 

{ sup \XI~Xt\>Xe, q^l} 

implies that after Cf-i? the process must have travelled through many narrow tunnels and spreaded 
far away from the box V3e before visiting another box, and when looking carefully into the details 
of the proof of Proposition 13.11 we will see that the quantitative estimates (especially the choice of A) 
is exactly the same. The same is true when estimating 

P( sup \X^ - Xt\ > Ae, M^ = /). 

Therefore, we can carry through the whole proof of Proposition 13 . 1 1 to conclude that with probability 
one, (Xj^")o^t^i converges uniformly to the diffusion paths (^t)o^t<i. CH 



Now the proof of Theorem 12. II is complete. 
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